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Abstract 



One of the most widely used methods for eigenvalue computation is the 
QR iteration with Wilkinson's shift: here the shift s is the eigenvalue of 

Q, ' the bottom 2x2 principal minor closest to the corner entry. It has been 

^O . a long-standing conjecture that the rate of convergence of the algorithm is 

cubic. In contrast, we show that there exist matrices for which the rate of 
convergence is strictly quadratic. More precisely, let Tx be the 3x3 matrix 
C^ ■ having only two nonzero entries {Tx)i_2 = {Tx)2i = 1 and let 7a be the set 

of real, symmetric tridiagonal matrices with the same spectrum as Tx- There 
exists a neighborhood lA <Z Ta oi Tx which is invariant under Wilkinson's 
shift strategy with the following properties. For To £ U, the sequence of 

^^ ' iterates (Tfe) exhibits either strictly quadratic or strictly cubic convergence to 

zero of the entry (rfc)23. In fact, quadratic convergence occurs exactly when 
lim Tk — Tx ■ Let X be the union of such quadratically convergent sequences 
(Tk): the set X has Hausdorff dimension 1 and is a union of disjoint arcs X" 

f~^ . meeting at Tx, where a ranges over a Cantor set. 

[~^ ■ Keywords: Wilkinson's shift, asymptotic convergence rates, symbolic dynamics. 
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1 Introduction 



The QR iteration is a standard algorithm to compute eigenvalues of matrices in 
C^ ' T, the vector space of n x n real symmetric tridiagonal matrices ([H], [Ij, [T5]). 

More precisely, consider T £ T and a shift s G K so that T — si is an invertible 
matrix. Given the QR decomposition T — si = QR, where Q is orthogonal and 
R is upper triangular with positive diagonal, the shifted step obtains a new matrix 
F(s,T) = Q*TQ = RTR-^. Let a;_(r) < a;+(T) be the eigenvalues of the bottom 
2x2 principal minor of T and let w(T) be the eigenvalue closer to the bottom 
entry {T)nn- Wilkinson's shift is the choice s — ijj{T) and Wilkinson's step is 
W(T) — F{u){T),T), provided u;(T) is well defined and is not an eigenvalue of T. 

A matrix T g T is unreduced if (r)i.i_|_i — (T)i_|_i .; ^ for 1 < i < n. Recall that 
if To is unreduced and the iterates Tk = W'''(ro), fc e N, are well defined then Tk is 
also unreduced and the bottom off-diagonal entry {Tk)n-i,n tends to 0. The quick 
convergence of Wilkinson's algorithm is a well known fact, as discussed in Section 
8-11 of [13]: we are interested in the precise rate of convergence of this sequence. 
It has been conjectured ([13], [1]) that it should be cubic, i.e., \{Tk+i)n,n-i\ = 
0{\{Tk)n,n-i\'^)- Here, instead, we show that cubic convergence does not hold in 
general: there are unreduced matrices Tq for which the rate of convergence of the 
sequence {Tk)23 to is, in the words of Parlett, merely quadratic. 

Given T G T, the above formulae imply that W(T), if well defined, is symmetric 
and upper Hessenberg and therefore T and W(T) are matrices in T with the same 



spectrum. For A — diag(l, —1,0), denote by TX the set of matrices in T similar to 
A and consider W as a map from 7a to 7a. There are some technical aspects to 
consider. First, there are matrices T G 7a with bottom entry T3.3 equidistant from 
a;_ and a;+. This introduces a step discontinuity in the map W: when T tends 
to such a matrix Tq, W(r) may approach either F(a;_(ro), To) or F(u;+(ro), Tq)- 
Also, strictly speaking, the definition of W does not apply to matrices T for which 
^23 — because then a;(T) is an eigenvalue of T. It turns out, however, that W 
can be continuously extended to such matrices. 

We introduce the notation required to state the main result of this paper. Let 

/O 1 0\ 

Tx=\\ e Ta. 

\0 0/ 

An infinite sign sequence is a function ct : N — » {+,—}■ Let S be the set of 
infinite sign sequences: S admits the natural metric d(ai, 02) = 2~" where n is the 
smallest number for which ai{n) ^ a2{n). The metric space E is homeomorphic to 
the middle-third Cantor set contained in [0, 1]. For a G S, let ct" G S be obtained 
by deleting the first sign, i.e., (j^{n) = a{n + 1). 

Theorem 1.1 There is an open neighborhood VI d Ta of Tx with the following 
properties. 

(a) IfT eU then W(r) € U. 

(h) IfTo Qti then the sequence Tk = W'^(To) converges to Too with (Too) 23 — 0- If 
Too = Tx then the convergence rate of (Tk) 23 to is strictly quadratic; otherwise 
it is strictly cubic. 

(c) Let X d 14 be the set of initial conditions Tq for which T^o — Tx ■ Then X is 
the union of arcs X'^ , cr e E, which are disjoint except for the common point 
Tx. Also, WiX") CX"'. 

(d) The set X dU has Hausdorff dimension 1 . 

We opted to make this paper as self-contained as reasonably possible at the 
expense of rendering several sections more technical. The inductive arguments 
employed to control the iteration make use of an assortment of explicit estimates. 
The parameters obtained are loose and might be replaced by a chain of existential 
arguments: we hope that our choice led to a clearer presentation. 

In Section 2 we introduce an explicit chart (/) : M^ ^ 7a, satisfying (f>{px) = Tx 
where px = (2,0). Shifted steps in these (x,?/) coordinates, i.e., the functions 
F{s,x,y) — (/)^"'^(F(s, (/)(x, y))), admit a simple formula: 

fl + s \s\ 

F{s, x,y) ^ x, ——y 

\ 1 — s 1 + s 

One may think of (x, y) coordinates as a variation of spectral data in the sense of 
[2] , [3] and [7j : eigenvalues and the absolute values of the first coordinates of the 
normalized eigenvectors. The map (f) is an instance of bidiagonal coordinates ([8]), 
which form an atlas of 7a: is a chart whose image is an open dense set containing 
Tx- In this chart, convergence issues reduce to local theory. The present text might 
serve as an illustration of the general construction: the downside is that the formula 
for (f) is presented without a natural process leading to it. 

The rather technical Section 3 is dedicated to the study of the sub-eigenvalues 
u}±{T) and their counterparts in {x, y) coordinates uj± — ijJ± o (j). We also introduce 



the rectangle TZa = [2 — a, 2 + a] x [—a/10, a/10], < a < 1/10, whose image 
TZa = 4'i'R-a) is the closure of the invariant neighborhood lA in Theorem ll.il The 
discontinuities of w lie on the vertical line x = 2: if a; < 2 (resp. x > 2), uj(x, y) — 
'^+(x,y) (resp. uj(x,y)=uj^{x,y)). 

In Section 4 we show that TZa is invariant under W = 0~^oWo0 (for sufhciently 
small a). The discontinuous map W has smooth restrictions to the interior of the 
rectangles 7^a,+ = [2 - a, 2] x [-a/10, a/10] and TZa,- = [2, 2 + a] x [-a/10, a/10] 
which extend continuously to W± : TZa.± — > TZa- 

The rest of the proof of Theorem 11.11 proceeds by studying the iterations of 
W. At this point, the vocabulary and techniques from dynamical systems are 
natural (another application of dynamical systems to numerical spectral theory is 
the work of Batterson and Smillie [1] on the Rayleigh quotient iteration). In Section 
5 we provide different characterizations of the set X — (f)^^{X) and show its non- 
triviality. 

In Section 6 we construct a continuous map from [—a/10, a/10] x S — > A" taking 
a pair (y,cr) to the only point po = {g'^{y),y) e A" for which p^ = W'^ipo) e TZa^a{k)- 
Informally, the sign sequence a specifies the side of the rectangle TZa in which pk 
lies. This yields the decomposition of A" as a union of the Lipschitz arcs X"' , cr € E. 
Sharper estimates concerning g'^{y) are needed to prove that X is very thin, i.e., 
that its Hausdorff dimension is 1. The concept of Hausdorff dimension is only used 
at the very end of the argument in order to obtain a more concise formulation of 
an estimate on the number of balls of radius r required to cover X . 

2 Local coordinates and s-steps 

The QR decomposition of an invertible matrix M is M = Q(M)R(Af), where 
Q(M) is orthogonal and R(Af ) is upper triangular with nonnegative diagonal. The 
s-step with shift s is the map 

F(s, M) = (Q(M - si))* M Q(M - si) (!) 

= R(Af-s/)M(R(M-s/))"^ (2) 

This is well defined provided M — si is invertible. 
As an example, to be used in the sequel, consider 



Tx 



Clearly, the matrix Tx — si is invertible if s ^ 0, ±1. For s ^ 0, |s| < 1, the QR 
decomposition of Tx — si is 
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QiTx - si) Il{Tx - si) = 



where si = (1 + s^)^^/^. Notice that Q(r — si) has a jump discontinuity at s = 
while R(T;t' — si) is continuously defined but ceases to be invertible at s = 0. 
Standard s-steps for Tx are given by 



F(s,T;t) = Y^^ I 1--^ 2s 



-2s 1 - s^ 0> 
- s2 2s 
0; 



Notice that this formula extends continuously to s = with F(0, T^") = Tx- Such 
continuous extensions will be important throughout the paper. 



Let T be the set of 3 x 3 real, symmetric, tridiagonal matrices. Let A = 
diag(l,— 1,0) and 7a = {Q*h.Q,Q e 0{n)} be the set of matrices in T similar 
to A. In fact, 7a is a bitorus ([II], [H]) but this kind of global information will not 
be used in this paper. 

We now define the relevant parametrization of 7a near Tx- Let £ be the set 
of signature matrices^ i.e., diagonal n x n matrices with diagonal entries equal to 
±1. An invertible matrix M is LU -positive if there exist lower and upper triangular 
matrices L and U with positive diagonals so that M = LU; equivalently, M is 
LC/-positive if its leading principal minors have positive determinant. For instance, 
the only LC/-positivc orthogonal matrix Q with Tx = Q*^Q is 

1/V2 l/\/2 0^ 
Q^Qx= I -l/\/2 1/V2 

1; 

Proposition 2.1 Set px = (2, 0) and (j):R'^ -^T, 

'{4 - x'^)rj 2xT% 

c, y) = -^^ \ 1xr\ -4(4 - x^ - Ax'^y'^ + x^^y^) 2yrl 

2yr\ y^(a;^ — 4)r\. 




where ri = ^4 + x^ + Ax'^y'^ and r2 = •\/4 + 4y^ + x^y'^ . Then (t){px) — Tx and 
there exist neighborhoods Ui C M^ of px and Bi C T of Tx such that 4)\ui is 
an immersion with image Ui = T\ f] Bi. Also, sign(a;) = sign(((/)(a;, y))i2) and 
sign{y) = sign(((^(x, j/))23). Finally, for matrices T = (f){x,y) G Ui, the only LU- 
positive orthogonal matrix for which T = Q*AQ is 

2a;(l + 2y^) xyri 

2(2 + a;2y2) _2j/ri 

y(4-x2) 2ri 

Since we are interested in the behavior of Wilkinson's step near Tx, we succes- 
sively define nested neighborhoods Uk of Tx and Uk oi px'- the process stops with 
the contruction of the rectangle TZa at Proposition 13.31 Constructions on matrices 
use boldface symbols. 

Proof: The example above obtains Tx = <P{px)- The remaining statements also 
follow from numerical checks, but instead we provide motivation for the construction 
of the map (j). Given the eigenvalue matrix A, any matrix T G 7a admits an 
orthogonal diagonalization T — Q*AQ, which is not unique: all other orthogonal 
diagonalizations are of the form T = Q*EAEQ for some signature matrix E ^ £. 

Now, the rows of Q are normal eigenvectors of T, so its columns qi , 92 and (73 
are also orthonormal. For vectors q and w, we write g ~ w to indicate that they are 
collinear. Say qi ^ wi — (2, —x, —2xy)*. It is well known that simple eigenvalues 
and their normalized eigenvectors vary smoothly with the related matrix; thus, for 
T e 7a, T near Tx, the first column of Q can indeed be written in the form above 
for an appropriate choice of x and y, {x, y) near px- Since — T31 — ((73, A^i), we 
must have 53 orthogonal both to wi and Awi and therefore q^ ^ wi x Awi ^ W3 = 
{xy,-2y,2)*. Similarly, q2 ^ W2 = W3 x wi = {2x{l + 2y^),2{2 + x^y^),y{A- x^))* . 
Thus, we search for an LU-positive orthogonal matrix Q = EWN where E ^ £, W 
has columns Wi and A'^ is a positive diagonal matrix. It is now easy to verify that 
A^ = diag(l/ri, l/(rir2), l/r2) where ri and r2 are given in the statement; also, 
by computing signs of pricipal minors, WN is LC/-positive and therefore E = T 
Expansion of the product Q*AQ obtains the formula for (jj. 

To show that (j) is an immersion near Tx, it suffices to verify that the Jacobian 
Dcj) at Px is injective. This is easily seen by computing partial derivatives of the 
entries Tn and T23 of T = ^(x, y) with respect to x and y. ■ 



Actually, (f)is a. diffeomorphism from M^ to its image [5] , but this will not be used 
in this text. A reason for choosing such a parametrization for the first column of Q 
will be clarified by the next proposition: an s-step admits a simple representation 
in {x, y)-coordinates. 

Proposition 2.2 There are open neighborhoods S2 C (—1, 1) of s — 0, U2 C Ui of 
p = px and IA.2 = i>{li-i) C U\ with the following properties. 

(a) The function F : (^2 \ {0}) x 1^2 —> Ui is smooth and extends continuously 
(but not smoothly) to S2 x i^2- 

(b) Let F : S2 ^ U2 —>^ Ui be F expressed in {x,y)- coordinates, i.e., F{s,p) — 
(j)-\¥{s,cl>{p))). Then, for p = {x,y) e U2, 

Fis, x,y) = ( x, -——y 



Proof: By definition, F(s,T) = (Q(T-s/))*TQ(T-s/). For T ^ Tx (i.e., T near 
Tx) and s « 0, s 7^ 0, we have T - si ^ Tx- The first two columns of Q(r - si) 
can be obtained from the corresponding columns oi T — si by the Gram-Schmidt 
algorithm and therefore 

/O 1 

Q(T - s/) w 1 

\0 -sign(det(r-s/))y 

Whatever the sign of det(T - s/), F(s, T) w Tx so that 

lim F(s,T)^Tx. 

This proves that there exist open neighborhoods S2 and U2 such that F : (^2 \ 
{0}) xtl2 ^Ui is well defined and extends continuously to the point (0, Tx) with 
F(0, Tx) = Tx. The smoothness of F in (S'2\{0}) xZ/^2 follows from the smoothness 
of Q in the set of invertible matrices. The fact that F extends continuously to 
5*2 X 1^2 will follow from the explicit formula in item (b). 

As in the proof of Proposition 12. 1[ for {s,To) € 6*2 x U2, write Tq — si — 
Qq{A — sI)Qo, where Qo is LC/-positive. Notice that if (ao,6o,co)* is the first 
column of Qq and Tq — (t>{xQ,yQ) then xq ~ — 25o/ao a-nd yo = co/{2bo). For s 7^ 0, 
let ixi,yi) = G(s, a;o,yo) and Ti = F(s,ro) = (f){xi,yi) = QJAQi where Qi is 
LC/-positive. Let (ai,5i,ci) be the first column of Qi: we have xi — —2bi/ai and 
2/1 — ci/{2bi). We must therefore compute ai,6i,ci. 

By definition, 

Ti = (Q(To - s/))*ToQ(To - si) = (Q(ro - sI))*Q*oAQqQ{To - si), 

so that Qi = EQqCKTq — si) for some signature matrix E — diag(ei, 62, 63) G £. 
Set Qi == EQ{Qo{To - si)) = EQ{{A - sI)Qo). Assuming \s\ < 1, we have the 
positive collinearity (ai, 61, ci)* ^ (ei(l — s)ao, e2(— 1 — s)bo, e3(— s)co)* and thus 

62 1 + s 63 s 

Xl = :; Xo, yi = — -- — Vo- 

ei 1 — S 62 1 + s 

Now, sign(xi) = sign((Ti)i2), i = 0,1, and sign((Ti)i2) = sign((To)i2) (from 
Proposition 12.11 and equation ^) and therefore sign(a:i) — sign(a;o); similarly 
sign(2/i) = sign(j/o), completing the proof. ■ 



The computations above fit into a larger context, which we now outhne. As 
with s-steps, many eigenvalue algorithms act on n x n Jacobi matrices (as in [5], 
U and |12j . real symmetric tridiagonal matrices T with Ti+i^i > 0, i = 1, 2, . . . , n — 
1) with non- Jacobi limit points. Recall that Jacobi matrices have simple (real) 
eigenvalues Xi and that its normalized eigenvectors vi can be chosen so that the 
first coordinates q are positive; notice that X^^f = 1- The eigenvalues \i and the 
norming constants Ci form a standard set of coordinates for Jacobi matrices. In 
these standard coordinates ([3]), the s-step acting on a Jacobi matrix T is given by 

(A.C.), - (^-E^(|A.-.|cO^)' ^^l,...,n 

provided T — si is invertible. Such coordinates require some modification in order to 
extend beyond the set of Jacobi matrices. The bidiagonal coordinates in [5] are, up 
to multiplicative constants, quotients Ci^i/ci which admit a simpler evolution under 
s-steps. The map (j){x,y) retrieves a matrix from one among 3! possible choices of 
bidiagonal coordinates. In general, each permutation tt of the eigenvalues obtains 
a map (j).„ : M" -^ 7a and this family of maps is an atlas for T\ . 

3 Sub-eigenvalues 

For a matrix T £ T, let T be its bottom 2x2 diagonal block. Given T £ T\, 
the eigenvalues u)-{T) < ci;+(r) of T arc the sub- eigenvalues of T. Notice that 

Consider the circles c^ , c^, C Ta through Tx parametrized by 





respectively. Clearly, T e Ch if and only if 723 — T33 — 0; also, T e c^, if and only 
if Til = T22 = T33 = 0. 

Proposition 3.1 There is an open connected neighborhood Uy, C 1^2 of Tx with 
the following properties. 

(a) For all T G U^ the following interlacing inequalities hold: 

-1 < u>-{T) < < ijJ+{T) < 1, a;_(r) < T33 < uj+{T). 

(b) The only matrix T G Z//3 for which <jJ^{T) = ijJ^{T) is T = Tx- The functions 
(jj± : U^ — > K are continuous and smooth on Vi^ \ {T;f}- 

(c) A matrix T € Vi^ belongs to c/,, if and only if at least one sub-eigenvalue of T 
coincides with an eigenvalue of T . In particular, T12 > for all T G U^. 

(d) A matrix T G Z^3 belongs to c„ if and only if 133 is equidistant from the sub- 
eigenvalues of T . 

(e) For all TeUa, uj±{T) G ^2. 

The set S2 mentioned in item (e) was defined in Proposition [221 

Proof: The first inequality is the interlacing of the eigenvalues of T and T, the 
second is the interlacing of those of T and T33. 

From item (a), if the sub-eigenvalues are equal then a;_(T) = i^+{T) = T33 = 0. 
Thus, T = and since the trace of T equals 0, one has Tu = and T12 = ±1: only 



the positive choice, which gives rise to Tx itself, is relevant. Continuity in lAy, and 
smoothness in W3 \ {T';f } of the functions (jJ± are now easy. 

For item (c), suppose that det(T- a;+(r)/) = (the case det(T-u;_(T)/) = 
is similar). By construction, u;-|_(T) must be a common eigenvalue of T and T. 
Expand the characteristic polynomial of T along the first row to obtain 

det(A/ - T) = (A - Til) det(A/ -f) + {-T^^^ + Ti^a^as)- 

A common eigenvalue annihilates the two determinants, thus Ti^2(^33~'^+(2^)) = 0- 
Since T12 ~ 1 and ijj+{T) equals an eigenvalue of A we have u)+{T) = T33 — 0. Now 
detT = 0, which implies T23 = 0. Notice that T12 = 0, r23 7^ implies that some 
sub-eigenvalue equals ±1: this possibility was excluded above. 

For item (d), if T33 is equidistant from the sub-eigenvalues, T e 7a must be of 
the form 

'-2d b 0^ 

b d c 
c dj 

Now det(r - A) = A3 - A = A3 + {-M^ - b^ ~ c^)X - 2dc^ + db^ + 2d^, so d{b^ - 
2c2 -f 2d2) = and 52 + c^ + 3^2 = 1. If d = 0, T belongs to c„: notice that Tx 
corresponds to = in the parametrization of c^,. If d 7^ 0, T lies in one of two 
curves in 7a which may be assumed to be disjoint from 1^3. 

Continuity of the functions u)± allows for a choice of W3 satisfying the final 
condition. ■ 

Let U3 = 0^^(Z^3). From item (c) above, {x,y) € U3 implies x > 0. Let 
r/j C M^ be the horizontal axis and r^ C M^ be the vertical line x — 2. Let 
i^3,+ = ^3 n {{x,y) \x<2}, U3^_ = Z^3 n {{x,y) \x>2}, U^^± = 0(Z^3,±)- 

It is convenient to work with a domain for [x, y) coordinates which is more 
explicit than the set lA^. 

Definition 3.2 LetTZa = [2 — a,2 + a]x [—a/10, a/10] C U3 be a rectangle centered 
in px — (2,0). The rectangle TZa is split by r^ in two closed rectangles 'R-a,+ = 

TZa n Z^3,+ and TZa,- =nan U3^- . 

From [8] , it follows easily that U3 can be taken to contain 7?.i/io: using this result, 
as we shall see, all the subsequent constructions are compatible with a = 1/10. To 
make this paper self-contained the working hypothesis is only a < 1/10, TZa C ZY3. 

We rephrase Proposition 13.11 in (a;, y)- variables. Write uj±{x,y) — u)±{(f>{x,y)) 
so that the functions lu± are continuous in TZa with a non-smooth point px- Set 

TZa - (piTZa). 

Proposition 3.3 The diffeomorphism (j) : TZa -^ TZa C 14^ yields bijections from 
fh n TZa to Cfi n TZa o,nd from r^ n TZa to Cy n TZa- The functions w±(x, y) are even 
with respect to y, i.e, uJ±{x,y) — LL>±{x,—y). For points {x,y) G TZa,+ (resp. TZa,-), 
{4>{x,y))3.3. is to the right (resp. left) of {uj+{x,y) + uj-{x,y))/2. 

Signs in the notation TZa,± indicate which among uj± is closer to {'t'{x,y))33'- 
unfortunately, they are the reverse of what their position might suggest. 

Proof: We already saw in Proposition[2lT]that sign((0(a;,y))23) — sign{y). Clearly, 
(j}{2, (tan 9)1^/2) equals the matrix used to parametrize c„ in the statement of Propo- 
sition [231 Evenness of u)±{x,y) in y is immediate from the explicit form of (j) in 
Proposition [SHI Finally, to decide which sub-eigenvalue of T = (j>{x,y) is closer to 
r33, compare trT = T22 +T33 ^uj+{x,y) +uj-{x,y) with 2T33: 



(2r33 - tvT)rlrl = {x - 2){x + 2){x^y^ + Sx^y'^ -4 + 4y 



2\ 



For X slightly smaller (resp. larger) than 2, the expression is positive (resp. negative) 
and thus T33 is to the right (resp. left) of {uj+{x,y) + uj-{x,y))/2. Since the only 
points where T33 = (a;+(x, y) +uj-{x, y))/2 are those in ryOTZa and TZa is connected 
the result follows. ■ 

We need more precise estimates for the sub-eigenvalues lu± near px = (2, 0). 
Definition 3.4 The wedge Xq is {(x^y) e TZa \ \y\ > \x - 2|/10}; set Xo^± = 

Xq n TZa,±- 

Figure [1] contains some of the geometric objects defined in this section; the 
triangles I?o.± will be defined in the next section. 



2-a 




ry (x = 2) 



Th (y = 0) 



Figure 1: The rectangle TZa] in TZa.+ j ^ = ^+ and in TZa.-, lo = lo- 



Lemma 3.5 Near the point px = (2, 0) the functions lo± have a cone-like behavior: 



{x-2)± V(.T - 2)2 + 32./ 
Lj± = h 0{d ) 



where d^ = {x - 2f + ?/. /„ ■jZa,±, / < \uJ±{x,y)\ < 2\y\; in X^.±, |y|/5 < 
W{x,v)\ <2|y|. 



Proof: As in Proposition 12.11 set rf = 4 

r\u? + (4 - x^)u: - \x^y^ = 0, 



A.x^y^ . The sub-eigenvalues solve 



LO± 



-4-|-a;2±\/A 
2^1 ' 



(3) 



where A = A(x,2/) = ((x + 2f -^ %x^y'^){{x - 2f ^ Sx'^y^) > 0, A = 0{d^). In 
particular, A = in TZa only for px — (2, 0). The expression for uj± in the statement 
of the lemma follows directly from 



A - 16((a; - 2f + 32/) = 0{d^). 

For y > 0, the inequality uj+ < 2\y\ is equivalent to A < (4r^y + 4 
in turn equivalent to 

8rly{4: - x'^ + 8y + Sa;^/) > 0, 



which is 



which is clearly true for < a; < 2. The case y < follows from the fact that w^ is 
even in y. A similar factorization holds for u-. Also, we have uj+{x,y)uj-{x,y) = 
—Ax'^y'^/rf and |w_(x,y)|, \uj+{x,y)\ < 1. Since a < 1/10 we have that Ax'^/rf < 1 
for {x,y) G TZa- The estimate for u!± in Ab.i is now somewhat cumbersome but 
straightforward. B 



We also need estimates for partial derivatives of lo± in terms of x and y. 



Lemma 3.6 For all (x.y) G TZa — {Px} i^^ partial derivatives {lo±)x and {Lo±)y 
satisfy < {uj±)x < 1 and \{uj±)y\ < 7/3. The equality {uj+)x — (resp. {ll'^)^ — 0) 
holds exactly for y = 0, x < 2 (resp. x > 2); for y ^ 0, ±y{LU±)y > 0. 

Proof: The partial derivatives of lu± are 

8a; 



(w±)2 



{^±)i 



rfy/A 
4x^y 



(1 + 2y2) Va] ± (_4 + x^+ 8y2 + 6x^y^ + IGx^y^) ) , 



(({A - x^)y/A] ± (16 + 24a;2 + x^ + 'i2x^y^ + Bx'*?/^)^ ^ 



which are well defined provided A 7^ 0, i.e., outside of p^:- = (2,0). Also, 

{(\ + 2y2) VAJ - (-4 + a:^ + %y^ + %x^y^ + IGx^y'')^ = %y^r\ > 

whence (1 + 2y'^)y/A > |-4 + a:^ + Sy^ + Gx'^y'^ + 16x'^y^\ and {uj±)x > 0; equality 
implies y = 0. Also, {uj±)x < 16a;(l + 2y2)/rf < 1. 
In the rectangle TZa, 



115 < 16 + 24a;^ + a;'' + 32a;^2;^ + Sx^^y^ < 142, 



(4-x^)\/A < 1/4 



and the signs of {Lo±)y are settled. Since 



< 



1 



y 



< 



1 



(a; + 2)2 + 8x2y2 8a;2y2 36O 



we also have 



l(^±).l<^ 



4a;^ 



16 + 24a;2 + a;^ + 32a;2y2 + 8a;'*y2 



(4-x2)v^) 7 
< - 



4 Wilkinson's step 

Take u)(T) to be the sub-eigenvalue of T closer to T33; in case of a draw, we arbitrar- 
ily choose uj{T) = u}+{T). Wilkinson's step is the map W(r) = F(a;(T),T). From 
now on we shall work in (a;, y) coordinates, i.e., with W{x, y) = (j)^^{W{4>{x, y))) = 
F(.<^{x,y),x,y) where uj{x,y) = u}{(l){x,y)). Write W±{x,y) = F{uj±{x,y),x,y). 
From Propositions 12.21 and 13.11 the maps W± : TZa -^ Ui are continuous and 
W : TZa -^ 1^1 is well defined with step discontinuities along r„ n TZa ■ From Propo- 
sition [^il] and the fact that LU-{x,y) < < uj^{x,y), 

W.ix,y) ^ iX^ix,y),Y,ix,y)) = (i±i^ ^^ ,'T^^7'\ v) (4) 

\l-uj±{x,y) l + uj±{x,y) J 

and therefore W± are smooth functions in TZa \ {pat}- Evenness of a;± with respect 
to y (Proposition Eini) yields X±{x,y) = X±{x, -y), Y±{x,y) = -Y±{x, -y). 

The rectangles TZa = [2 — a, 2 + a] x [—a/10, a/10] for a < 1/10 are invariant 
under W; furthermore, the maps W± are injective on TZa.±- Figure [2] provides 
strong evidence to these facts, proved in Propositions 14.11 and 14.31 




WiUa) 



Figure 2: W+{TZa,+), W-{TZa-) &ndW{TZa), a = 1/10; vertical scales are stretched. 

Proposition 4.1 Take a < 1/10 satisfying also TZa C U^. Then W{TZa) C TZa- 
Moreover, \Y{x,y)\ < |y|/49 for all {x,y) G TZa, X^{x,y) > x for all {x,y) £ TZa,+ 
and X^{x,y) < x for all {x,y) G 7^q,_. Also, W±{{2^a} x [-a/10, a/10]) C TZa,±, 
W±{r,) dTZa,^. 

Proof: From Lemma 13.51 we have \uj±{x,y)\ < 2a/10 < 1/50 and therefore 
\Y{x,y)\ < |y|/49. We now prove that W+(JZa^+) C TZa- Clearly, {x,y) G 7^a,+ 
implies |y(a;,y)| < a/10 so we must prove that 2 — a < X-^-{x,y) < 2 + a. From 
equation ^, X+(x, y) > x with equality exactly when y = 0. Since {uj+)x{x, y) > 
we have Xx{x, y) > and it therefore suffices to prove the two claims in the state- 
ment, i.e., ^+(2 ~ a,y) < 2 and X-^-{2,y) < 2 + a. For a < 1/10, the inequalities 
follow from uj+{x,y) < 2a/10 < a/(4 + a) < a/(4 — a). Similar checks apply to 
W-. U 

Each rectangle TZa.± is not invariant. Denote the interior of a set X C M^ 
by int(X). Given b > 0, b < a, let I?6.+ (resp. I?6.-) be the triangle defined 
by2-a<x<2-fe- 10|y| (resp. 2 + b + 10\y\ < x < 2 + a). Notice that 
int(Ao) — TZa \ (^0,+ U T>o,-)- Let 3^o C Xq be the thinner open wedge defined by 
|j/|>10|2;-2|. 



kivifol 



i:mTw& 



-^4. 



b ''■ 



yo 



■Rj 



Th 2 + a 



i^ 



M- 



Figure 3: {xQ,ya) G Vb,+ implies (a;i,yi) = W+{xo,yo) G T>b,^ 



Proposition 4.2 The triangles T>b,± are invariant, i.e., Wj^iT>h,+) C T>b^+ and 
W-{T>b,-) C T>b,-- If P 7^ (2 ± 5, 0), (2 ± a, 0) is a boundary point of T>b.± then 
VF±(p)Gint(I?fa,±). 
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Finally, W+{yo) C int(X>o,-) and W-iyo) C int(Po,+)- 

This will imply f Proposition 15 . 1) 1 that points in I?o.± and 3^o have cubic conver- 
gence to points in r^ different from px ■ 

Proof: We prove that H^+(I?b^+) C I?f,.+ by computing the slope a of the segment 
joining {xo,yo) and 

, ^ jrr f ^ fl + Lj+{xo,yo) uj+{xo,yo) 

(xi,yi) = W+{xo,yo) = -; j r xq,—^ j r yo 

\l-uj+{xo,yo) l + w+(xo,yo) 

given by 

-yo 1 - w+ 



LU+ 2xo(l + ^+) 

where uj+ stands for (jj+(a;o, j/o) (see Figure[3l). By Lemma [3751 \uj±\ < 2\y\: simple 
algebra then shows that, for a < 1/10, we have |a| > 1/10 and the segment is 
steeper than the non- vertical sides of 2?b,+. Since yo and j/i have the same sign, 
invariance of !?(,.+ follows. The argument for I'b,- is similar. 

The verification that W±{yo) C int(I?o,T) ^ses estimates of the form \y\/2 < 
l^±(p)l < 1/20 in the closure of 3^o; details are left to the reader. ■ 

Proposition 4.3 Each map W± : mt{TZa.±)"^fh ~^ T^a is an orientation preserving 
diffeomorphism to its image. 

Proof: We consider the Jacobian matrix 



DW±{x,y) 



2 



(l-(^±))2 l-iu;±) (l-(^±)) 



(5) 



V (l + (c^±))2^ (l + (^±))^^ l + i^±)/ 



From Lemma [3.61 {X±)x > and {Y±)y > with equality precisely for y = 0. 
Similarly, for y ^ 0, sign{{X±)y) = sign((Kt)j;) == ±sign{y). 

To prove that W± are local diffeomorphisms in int(7?.a^±) \ r^, write 

det i^W^±(x, y) = ±-^ /2{u±Uv±x ^ ^^^^^^^ + ^^(i + ^^)'\ . 

l~ iOj. \ i — UJ± J 



From Lemma [3. 6[ all terms in the sum in parenthesis have the same sign and thus 
A&XDW±(x,y) > if J/ 7^0. 

We now prove the injectivity of Wj^ on int(7?.a,+) \ r^. From symmetry, it 
suffices to prove the injectivity of W+ on 7?.a,++ = {{^^v) G int(7?.a,+) | y > 0}. 
In other words, given (a;i,j/i) S 7?.a, y\ > 0, we must prove that there exists at 
most one point {x,y) G 'R-a,++ with W+{x,y) — {xi,yi). Let 7 : [0,1] -^ R^ be 
the piecewise affine counterclockwise parametrization of the boundary of TZa.++ 
with 7(0) = 7(1) = (2 - a,0), 7(1/4) = (2,0), 7(1/2) - (2, a/10) and 7(3/4) = 
(2 — a, a/10). Since detDW+(a;,y) > for y > and yi > 0, {xi,yi) is a regular 
value of W+ and, assuming {xi,yi) ^ {W+ o 7)([0, 1]), the number of solutions of 
W+{x,y) = {xi,yi) is given by the winding number Ci of W+ o 7 around {xi,yi). 
Recall that a simple way to compute 2ci is to count with signs the intersections 
of W+ o 7 with the vertical line through (xi, j/i). From the signs of the entries of 
DW+, the X coordinate of {W+o'y)(t) is strictly increasing from t = to t = 1/2 and 
strictly decreasing from t = 1/2 to t = 1. Thus, there are at most two intersection 
points and |ci| < 1 implying injectivity. If (xi,yi) G {W+ o7)([0, 1]), the argument 
above applies to nearby points and the result follows by continuation outside r^. 
The proof of the analogous statement for W- is similar. ■ 
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Actually, the maps W± : 'JZa,± — > Ti-a are homeomorphisnis to their respective 
images; we omit details. 

We conclude the section with some estimates on DW± which will be used in the 
last section. A vector v — (vi, V2) is near-horizontal if wi > and |w2|/fi < 1/25. 

Lemma 4.4 Take p G TZa, P 7^ Px- Let v — (fi,W2) be a near-horizontal vector. 
Then v — DW±(j))v = (wi, V2) is also near-horizontal and < fi/2 < wi < lOwi. 

Proof: From the expressions of the entries of 

^mii TO12 



DW± 

\m2i m22 

in equation ([5]) above and the estimates in Lemma 13.51 and Lemma [ 

0.96 < TOii < 5.5, |mi2| < 10.5, |m2i| < 0.0105, |m22| < 0.045. 

The claims now follow from easy computations. I 

5 Convergence rates of Wilkinson's shift strategy 

We now consider the asymptotic behavior of Wilkinson's shift strategy. Given a 
W-orbit (Tk), Tk+i = W(Tfc), we study the decay of the entry {Tk)23- In {x,y) 
coordinates, the W^-orbits (pk) are defined by Pk-\-i = W{pk), Po & Ti-a] the rele- 
vant issue is the convergence rate of the y coordinate since, from Proposition 12.11 
{4>{x, y))32/y — 2ri/r| > is bounded and bounded away from in TZa- 

A VF-orbit (pk) has strictly quadratic (resp. cubic) convergence if there exist 
constants c, C > such that, for all k E N, 

c\ykr<\yk+i\<C\ykr 
for r = 2 (resp. r = 3) (here pk = (xk,yk))- 

Proposition 5.1 Let p G TZa \ Th, Pk = W^{p). If Pk & T^o,± for some k then 
the W-orbit {jpk) has strictly cubic convergence. Otherwise pk G Xq for all k and 
convergence is strictly quadratic. 

Proof: The case y^ = is trivial. Assume without loss that pk G T>o,+- From 
Proposition 14. 2i there exists b > such that Pk+i G T^b,+ and therefore pj £ T>b,+ 
for all j > k. From Proposition lS.ll uj^ is an even, smooth function in the y- variable 
in T^b,+ - From compactness and Lemma [3.5) given & > there exists C > such 
that for all {x,y) G T>b,-\- we have y^ < uj+{x,y) < Cy"^. From equation ([4]), 

\y\y2<\Y+{x,y)\^-^\y\<2C\y\' 

for {x,y) e T)b.+ i yielding strictly cubic convergence. 

We now consider orbits in the wedge X^ — A'o.+ U A'o.-- From Lemma |3.5[ 
|j/|/5 < |w-j-(a;,y)| < 2|y| for (a;,y) € Xq±: strictly quadratic convergence now 
follows from 



HVlO < \Y+{x,y)\ =. 2+t[l]y) ^y^ " ^'^''' 



completing the proof. 
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Notice that the constant C in the proof depends on b and therefore the rate of 
cubic convergence is not uniform in Pq.+ i consistent with strictly quadratic conver- 
gence for orbits in Xq. 

Corollary 5.2 Given p E TZa \ r^, consider the W-orbit pk = W'^{p). The follow- 
ing conditions are equivalent: 

(a) limfc^ooPfc ^ Px; 

(b) Pk G '^0 for all k; 

(c) the W-orbit (pk) has strictly quadratic convergence. 

Proof: The estimate |y(x,y)| < |2/|/50 guarantees convergence to some point poa 
in r/j . Orbits contained in Xq must then converge to px ■ Conversely, if Poo 7^ Px 
then Poo € int(I?b,±) for some & > and pk G 'Db,± for sufficiently large k. ■ 

Let X C TZa be the set of points p for which lirnp^. — px- From Proposition l4.21 
A' C A'o \ 3^0 and therefore r„ n A" = r/j n A" = {px}- We still need to prove that 
X 7^ {pa"}: this and more will be done in this section. 

Figure |4] shows X extended to a rectangle much larger than T^-i /iq: numerical ev- 
idence indicates that even in such larger regions the qualitative descriptions remain 
valid. 

A compact set ii' C K is a Cantor set if K has empty interior and no isolated 
points. As we shall prove in Theorem 16. 11 horizontal sections of X are Cantor sets. 
The set X is the union of uncountably many arcs, disjoint except at px- Each arc 
intersects a horizontal line in a single point. 

The Hausdorff dimension of the middle-third Cantor is log 2/ log 3 « 0.63 ([5] 
and [H] contain a thorough discussion of Hausdorff dimension) . More generally, self- 
similar Cantor sets have positive Hausdorff dimension. The horizontal sections of X 
are much thinner: they have Hausdorff dimension 0. That is why the fine structure 
is invisible in this figure, unlike most figures of Cantor sets in the quoted books. 

Numerical evidence also indicates that the northwest-southeast leg of set X is 
the union of a family of analytic curves, tangent (not crossing) at px'- we shall not 
pursue this matter further. 




(3,0) 



Figure 4: The set X near px] in scale. 

We provide another description of X as the intersection of a nested sequence of 
compact sets Xn. From Corollarv l5.2[ p = po G X ii and only ii pk — W'^{p) £ Xq 
for all k. Define Xn C TZa to be the set of points p G TZa such that pk E Xq for 
< fc < n. Thus Xo = Xo, X„+i = W'^Xn) C X^ and X = fln-^™- Figure 
[5] indicates the first few sets. As the diagram suggests, the interior of Xn has 
2"+! connected components which we now describe. In a sense, the Cantor sets on 
horizontal lines are limits of this successive doubling of components. 
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yi-+)yi — ) 




Figure 5: The upper halves of the sets Aq, X^ and X^ \ schematic. 



Decompose the sets Xn by tracking on which side of r^ the points pk he. A sign 
sequence of length n is a function r : {0, . . . , n — 1} —t {+, — } or, equivalently, a 
string of n signs (t(0), t(1), . . . , T{n — 1)). Define 

A"^ = {p e 7^a I pfe e A'o^T-(fc), <k<n;pnG Xq}. 

Figure [6] shows a simple example: the upper half of X^ is a curvilinear triangle 
with base contained in the top side of TZa,+ and vertex in px] the upper half of 
X^~'' is a similar triangle in TZa-- Also, A"!^"^^ = ^+^(^"0) n 7^a,+ C A^. 



W^\A) W^+^(S) M/^+^C) 




PA' 



Figure 6: The upper halves of Xq W+{TZa,+) and X{~^' — W_f_^{Xo); schematic. 

We also consider infinite sign sequences, i.e., functions ct : N — > {+, — }, and the 
corresponding sets 

A"^= f] Xn^ '"•'■--''' . 

neN 

Let S be the set of infinite sign sequences: S admits the natural metric d{ai, 02) — 
2~" where n is the smallest number for which CTi(n) 7^ cr2(n). The metric space E 
is homeomorphic to the middle-third Cantor set contained in [0, 1]. The sets X'^ 
are compact, the intersection of two distinct such sets is {px} and the union of all 
X'^ is X . We conclude this section by proving that the intersection of each X'^ with 
a horizontal line is not empty. 

Let i± be the sides y — ±a^, —a < a; < a, of TZa- 

Proposition 5.3 Let 7/1 : [0, 1] — > TZa \ {px} be a parametrized curve with 7/!.(0) G 
I?o,+; 7/1(1) S T>Q,-- Let a be an infinite sign sequence: there exists t e [0, 1] such 
that-fh{t) e X". 

Notice that we do not claim that such t is unique: this requires stronger hypoth- 
esis and will be discussed in the next section. 

Proof: We first prove by induction on n that the connected component containing 
Px of each set X^ has elements on the sides £± (here r is a sign sequence of length 



14 



n) . The case n = is trivial and the case n = 1 has already been discussed (Figure 
[S]) . Assume the connected component of A','[ containing px to have a point in £^ . 
Consider a path jy : [0,1] — > X^ with 7„(0) = px, 7u(l) G i+. The image of 7^, 
must intersect the curve W+{£+): let to be the smallest t in this intersection. The 
image of the restriction of 71, to [0,to] is contained in W+{TZa,+)- Since 1V_|- is a 
homeomorphism on its image (Proposition 14. 3|) . there exists 71,+ : [0,io] ^ '7^a.+ 
with W+{-fv+{t)) = lv{t) for all t. Also, the image of 7^+ is contained in X^^'[' . A 
similar construction works for £_ and for X^^^'j'' , completing the proof of the claim. 
Consider an infinite sign sequence a and its restrictions r„ — cr||o,i n-i}- For 
each n let 

The sets Kn are nested, compact and nonempty and therefore their intersection is 
also nonempty. Any t in the intersection satisfies 7ft (i) G X"' . ■ 

As we shall see in the last section, the sets X^ and X"^ are connected but the 
proof of this fact requires more careful estimates. 

6 Geometry of JY 

The main result of this section. Theorem 16. 1[ is that X is very thin, almost as thin 
as a finite union of curves. More precisely, X has Hausdorff dimension 1. 

Denote the length of an interval / by /x(/). Given an infinite sign sequence 
a, define cr" by a^{k) = a{k + 1) (this is the standard shift operator in symbolic 
dynamics). Similarly, for a sign sequence r of length n+1 let r'' be the sign sequence 
of length n defined by T^{k) = T{k + 1). 

Theorem 6.1 For any infinite sign sequence a, the set X" is a curve X"^ — 
{(g^iy), y),y e [-a/10, a/10]} where g" : [-a/10, a/10] ^ [2 - a, 2 + a] is a Lips- 
chitz function. Wilkinson's step takes curves to curves: W{X'^) C X'^ . The set X 
has Hausdorff dimension 1 and the intersection of X with any horizontal line is a 
Cantor set of Hausdorff dimension 0. 

We need a few preliminary definitions. A near-horizontal curve is a C^ function 
-f : I ^ TZa \ {px} such that, for all t in the interval /, the tangent vector ■^'{t) is 
near-horizontal (as defined at the end of Section 4). A near- horizontal curve 7 is 
standard if the first coordinate of 7(t) is x = t. For any near- horizontal curve 7 there 
exists a unique strictly increasing C^ function a, the standard reparametrization of 
7, for which 7 = 7 o a is standard. The height of a near-horizontal curve 7 is 
y^ = 7(2) so that 7 crosses r„ at (2, y*). 

Near-horizontality is preserved by Wilkinson's step: from Lemma |4.4[ if 7 : / — > 
T^a,+ \ {px} (resp. TZa,- \ {pa'}) is a near-horizontal curve then so is W+ 07 (resp. 
W- o 7). This process squeezes near- horizontal curves towards the line r^. The 
constant 1/25 in the definition is somewhat arbitrary but it can not be replaced by 
very small numbers since W does not take horizontal lines to horizontal lines. 

Recall that 3^o C Xq is an open wedge with the property that p G ya implies 
W±{p) € T^o,T- F'o'^ ^ ^igii sequence t of length n, define 

X = {p^TZal Pk e '%'o,r(fc), <k<n;pne yo}. 

In particular, the orbit {pk) with po G y^ escapes Xq starting from k = n+1. Also, 
y^ is an open subset of X^ disjoint from A'„+i. 
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P;t = (2,0) ti 



Figure 7: A near- horizontal curve (not in scale). 

Lemma 6.2 Let 7 : [io,ii] — *■ Ti-a \ {px} be a standard near-horizontal curve. 
Assume that 7(^0) G 2?o,+; 7(^1) G 2?o,-- -^e^ J/* = 7(2) be the height of ^. Then, 
for^{t)^{t,y)eXa, 

\y,\/3<\y\<3\y,\. 

Let T be a sign sequence of length n. The sets 

i: ^{te [to, ii] I 7(t) e K}, Jn = {t^ [to, h] \ m e 3^,1} 

are intervals and their lengths satisfy 



IQu+l 



2" 



<t,{j:)<fi{i:)<r\4Qy,\^ <2 



(-2<"-i)) 



Proof: As a basis for an inductive proof, we first consider the case n = 0. Set 

Io = {te [to,ti] I 7(t) e Xa}, Jo = {te [ta,ti] \ 7(i) e J^o}. 

Draw lines through the point (2,?/,) with linear coefficients ±1/25, as in Figure [71 
and compute their intersections with the diagonals of TZa- Since these diagonals 
are steeper than 7, the intersection of the image of 7 with Xq is contained in the 
shaded triangles. An elementary geometric argument proves the first claim, verifies 
that Iq and Jq are intervals and obtains the estimates for their lengths. 




Figure 8: A near- horizontal curve (not in scale). 

We now do the induction step. For a sign sequence r of length n + 1 with 
t(0) = -f- (the other case is similar), consider 

i:+i = {t(^[to,h]\^it)ex:+,} 
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and 7+, the restriction of 7 to [to, 2]. Set 7" = W+ o 74. (see Figure [5]): as remarked 
above, 7" is a near-horizontal curve. Let a" : [ip'^'i] ~^ [^o,2] be the standard 
reparametrization of 7' so that 7' — ^'^ o a' is a standard near-horizontal curve 
with height yt = j*{2). Notice that 7«(4) e 2?o,+ and 7f(t}) e Po,- so, by the 
induction hypothesis, the sets 

In - {i e [tli] I 7»(t) e A-;"}, j:' ={te [44] | fW e x"} 

are intervals whose lengths ij,ij satisfy 



1 



IQn+l 



< tj < 4 < 2" 1 402/5 



(6) 



Clearly, t G Z|^ (resp. J^^ ) if and only if a^{t) £ Inj^i (resp. J^+i), proving that 



1^,1 and J^+i are intervals; let ^/,-^j be their lengths. By Lemma [4.41 1/10 < 
{(^)'{t) < 2 for all t £ In and therefore 



{1/I0)e^j < Ij <li< 2l\. 



(7) 



Let 7(a*(2)) = {x,y) so that W+{x,y) — (2, j/J). From the first claim, |y*|/3 < 
[y[ < 3|y,|. By Lemma \3l5\ {l/5)[y[ < uj+{x,y) < 2[y[. Thus, by equation [H 
(l/10)y^ < I2/II < 3y^. Combining these estimates, 



y*_ 

90 



> 



|40y«| < [40y, 



(8) 



The required estimate now follows from estimates [H [71 [Hjand the fact that |y*| < 
a/10 < 1/100. ■ 

Proof of Theorem I6.lt We organize the proof by stating a few claims, most 
of which in terms of a standard near-horizontal curve 7; notation will be borrowed 
from Lemma 16.21 For a sign sequence r of length n, consider the sign sequences 
(r, +) and (r, — ) of length n + 1. 



Ar,+) 






are contained in different connected components 



(a) The intervals I\j^i 

Ofll-^j^n-^ 

This claim is proved by induction. The case n = follows from the inclusions 
X^ C Ao^± \3^o- The induction step employs the same construction used in Lemma 
16.21 for T"" = W'^ o 7 o a'' , the standard reparametrization a' maps the intervals 
constructed from t' to the corresponding intervals for r. 

(b) For an infinite sign sequence a there is a unique f^ £ [to,ti] with ji^) £ X'^ . 
This follows directly from Lemma W^ since 



lim fi{In 



'^1(0, 1. 



-1}^ _ 



) = 0. 



In particular, a horizontal line at height y (which is near-horizontal) meets X" at 
a single point {g'^{y),y)- The Lipschitz estimate \g'^{yi) — 5°^ (2/2)! < 25|?;i — J/2I 
likewise follows by taking lines of slope less than 1/25. 

(c) Let (Ti and a-i he distinct sign sequences and n be the smallest integer for which 
ai{n) 7^ (72 (n). Then 



1 
10"+i 



90 



< It" 



r^l <2"|40y* 



< 



2(-2 



("-1) 
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Let T be the restriction of cri or 0-2 to {0, 1, . . . , n— 1}. Without loss, f^^ e Vi+i 

and f^^ £ In+i ■ Lemma [6.21 and claim (a) obtain the inequalities. 

In particular, the map from the Cantor set T, to K. = {t G [to,ti] \ 'y{t) e X} 
taking a to tg- is a homeomorphism. 

We now construct a very thin open cover of IC. For a positive integer n let 
5+ C S be the set of the 2" sign sequences a for which k > n implies t7(fc) = +. 
Let Br{p) be the ball of radius r and center p. 

(d) Take r > and /ef n he a positive integer such that r > 2^^^ '. Then the 
balls Br(t" ) C M, (T^ e S^ , form an open cover of K. 

From step (b), given t £ JC there is some cr e S for which t = f^. Let cr„ e 5*+ 
be defined by 

,,, /cr(fc), 0<fc<n, 
""(')^\+, k>n 

so that cr„ e S'+. From claim (c), |i'^" - t"^] < 2(-2'""''^ and therefore t e Brit"'-). 
The open cover of /C above is now adapted to yield a thin open cover of X . For 
a positive integer to, let Y^ C [—a/10, a/10] be a set of 277i equally spaced points 
so that given y e [—a/10, a/10] there is yo G Fm with \y — j/„i] < a/(10TO). Also, 
for positive integers n and to., let X„^m C A" be the set of 7712*^"+^^ elements of the 
form(g'^(2/),y), o" G 5'+, y € F^. 

(e) Consider r > anii positive integers n and to, smc/i t/iat r > 2*-^"^ •* anrf 
r > 1/to. T/ien i/ie balls Br{p) C M^, p g Xn,m, form an open cover of X . 

Given p — {g'^{y),y) let j/o be the point of Ym closest to y and let o-„ e 5"+ be 
defined as in claim (d). Let pi = {g'^"{y),y) and po = {9'^'^ {yo) , Vo) ■ Notice that 
Po € Xn^m- In order to prove that p belongs to Bripo) we estimate [p — Pi] and 
1pi— Pol- From claim (d), [p- pi] < r/2. Since a < 1/10, \y—yo\ < a/(10m) < r/100; 
from the Lipschitz estimate after claim (b), \g'^°{y) — g'^°{yo)\ ^ ^/^ and therefore 
\pi-Po\ <r/2. 

(f) The set /C has HausdorjJ dimension 0. 

Given r > 0, let n = [1 + log2(— logj r)] . Claim (d) obtains on open cover of JC 
by 2" < —4 log2 r balls of radius r. For all d > we have 

lim(-41og2r)r'* = 0. 

Thus, /i(i(/C), the Hausdorff measure of dimension d oi IC, equals 0. 

(g) The set X has HausdorjJ dimension 1 . 

Since X contains curves this dimension is at least one. It suffices therefore to 
prove that Hd{X) = for all d > 1. 

Given r > 0, let n = [2 + log2(— log2 r)~\ and to, — \l/r~\ . Claim (e) obtains an 
open cover of X by to2"+^ < — 32(log2 r)r~^ balls of radius r. For all d > 1 we have 

lim-32(log2r)r"V'' = 0. 

r— >0 

This implies fid{X) = 0, proving the claim and completing the proof of the theorem. 



7 Conclusion 

Recall that w+ (T) and u) — (T) are the roots of a quadratic polynomial with smooth 
coefficients (in T). The algorithm to obtain a; given T is thus multivalued: in 
variables x and y, the domains of choice of each sign are rectangles sharing the line 
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ry. In a sense, dynamical systems techniques arose naturally in this paper because 
orbits may switch sides arbitrarily often and according to arbitrary patterns (sign 
sequences). An outstanding example of the use of sign sequences to label orbits are 
kneading sequences (|11|). Many numerical algorithms do not exhibit this kind of 
complexity. 

Unsurprisingly, the set X of atypical points is more complicated than an al- 
gebraic surface. This is why Hausdorff dimension becomes the relevant tool to 
quantify the size of X. Theorem 16.11 shows that X is very thin. 

A matrix is an AP-matrix if its spectrum contains an arithmetic progression 
with three terms and is AP-free otherwise. This paper concentrated on the sim- 
plest examples of AP-matrices, with eigenvalues —1,0 and 1. We conjecture that 
the presence of three term arithmetic progressions in the spectrum (and whether 
the three eigenvalues are consecutive) dictates the possible rates of convergence of 
Wilkinson's shift. In a forthcoming paper we show that for AP-free tridiagonal 
matrices, cubic convergence of Wilkinson's shift indeed holds ([9], [10]'). 
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